Speaker recognition in two-wire test sessions

نویسندگان

Hagai Aronowitz

Yosef A. Solewicz

چکیده

This paper deals with the task of speaker recognition in fourwire training and two-wire testing conditions. Instead of performing blind speaker diarization before the recognition stage, we directly perform the recognition on the nonsegmented (or imperfectly diarized) speech. We present an analysis of the problem with respect to three different speaker recognition systems and propose improved recognition techniques both in the frame domain and in the model domain. The proposed techniques reduce error rate significantly. Furthermore, the developed techniques may be also beneficial in conjunction with an imperfect blind diarization stage.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two-wire nuisance attribute projection

This paper addresses the task of nuisance reduction in twowire speaker recognition applications. Besides channel mismatch, two-wire conversations are contaminated by extraneous speakers which represent an additional source of noise in the supervector domain. It is shown that two-wire nuisance manifests itself as undesirable directions in the interspeaker subspace. For this purpose, we derive tw...

متن کامل

Implicit Segmentation in Two-Wire Speaker Recognition

This paper presents a novel self-contained two-wire speaker recognition framework. The classical approach to two-wire speaker recognition usually requires a preliminary explicit speaker segmentation stage in order to extract audio files for the two hypothesized speakers. We propose an implicit speaker segmentation method implemented at the supervector level of speaker recognition systems. By pe...

متن کامل

Voice mining with multiple target speakers

In the basic speaker verification task, an unknown voice segment that contains the voice of a single speaker is checked against the acoustic model of a single target speaker. In the multiple-speaker voice mining application, a large set of audio sessions is searched for the sessions of several target speakers. Each of the audio sessions may hold the voice of more than one speaker. This applicat...

متن کامل

Speaker recognition using kernel-PCA and intersession variability modeling

This paper presents a new method for text independent speaker recognition. We embed both training and test sessions into a session space. The session space is a direct sum of a common-speaker subspace and a speaker-unique subspace. The common-speaker subspace is Euclidean and is spanned by a set of reference sessions. Kernel-PCA is used to explicitly embed sessions into the common-speaker subsp...

متن کامل

A new procedure for classifying speakers in speaker verification systems

In this paper we propose a new measure to classify speakers with respect to their behaviour in speaker recognition systems. Taking the proposal made by EAGLES as a point of departure we show that it fails to yield results that are consistent between closely related speaker recognition methods and between different amounts of speech available for the recognition task. We show that measures based...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Speaker recognition in two-wire test sessions

نویسندگان

چکیده

منابع مشابه

Two-wire nuisance attribute projection

Implicit Segmentation in Two-Wire Speaker Recognition

Voice mining with multiple target speakers

Speaker recognition using kernel-PCA and intersession variability modeling

A new procedure for classifying speakers in speaker verification systems

عنوان ژورنال:

اشتراک گذاری